Improving summarization performance by sentence compression: a pilot study
نویسنده
چکیده
In this paper we study the effectiveness of applying sentence compression on an extraction based multi-document summarization system. Our results show that pure syntactic-based compression does not improve system performance. Topic signature-based reranking of compressed sentences does not help much either. However reranking using an oracle showed a significant improvement remains possible.
منابع مشابه
Improving Quality of Vietnamese Text Summarization Based on Sentence Compression
Sentence compression is a valuable task in the framework of text summarization. In previous works, the sentence is reduced by removing redundant words or phrases from original sentence and tries to remain information. In this paper, we propose a new method that used Grid Model and dynamic programming to calculate n-grams for generating the best sentence compression. These reduced sentences are ...
متن کاملCitation Handling: Processing Citation Texts in Scientific Documents
Title of thesis: CITATION HANDLING: PROCESSING CITATION TEXTS IN SCIENTIFIC DOCUMENTS Michael Alan Whidby Master of Science, 2012 Thesis directed by: Professor Bonnie Dorr Dr. David Zajic Department of Computer Science Citation sentences (sentences that cite other papers) play a key role in the summarization of scientific articles. However, a citation-based summarization system that depends on ...
متن کاملImproving Multi-documents Summarization by Sentence Compression based on Expanded Constituent Parse Trees
In this paper, we focus on the problem of using sentence compression techniques to improve multi-document summarization. We propose an innovative sentence compression method by considering every node in the constituent parse tree and deciding its status – remove or retain. Integer liner programming with discriminative training is used to solve the problem. Under this model, we incorporate vario...
متن کاملUsing Coreference Links and Sentence Compression in Graph-based Summarization
Recent years have shown that graphs are an adequate text representation model for summarization. For this years’ TAC update summarization challenge, we extended our graph-based summarization system with coreference relations and sentence compression. Our results show that using coreference relations did not result in a significant performance gain; sentence compression had a negative effect on ...
متن کاملThe Potential And Limitations Of Automatic Sentence Extraction For Summarization
In this paper we present an empirical study of the potential and limitation of sentence extraction in text summarization. Our results show that the single document generic summarization task as defined in DUC 2001 needs to be carefully refocused as reflected in the low inter-human agreement at 100-word 1 (0.40 score) and high upper bound at full text 2 (0.88) summaries. For 100-word summaries, ...
متن کامل